DIOPHANTINE PROPERTY IN THE GROUP OF 
AFFINE TRANSFORMATIONS OF THE LINE 



PETER P. VARJU 

Abstract. We investigate the Diophantine property of a pair of 
elements in the group of affine transformations of the line. We 
say that a pair of elements 71 , 72 in this group is Diophantine if 
there is a number A such that a product of length I of elements of 
the set {71, 72, 7i~ 1 , 72"*} is either the unit element or of distance 
at least A~ l from the unit element. We prove that the set of non- 
Diophantine pairs in a certain one parameter family is of Hausdorff 
dimension 0. 



1. Introduction 

Let G be a group endowed with a distance d(-, •). At this point, we 
make no assumption on d(-, •), but later we will specialize to concrete 
groups which come with natural metrics. We denote by 1 the unit 
element of G (as well as the unit element of any other multiplicative 
group). 

Let 71, 72 G G be two arbitrary elements and write 5* = {71, 72, 7f , r j 2 1 }- 
We denote the set of words of length at most / over S by 

W\ := {g G G : there is Si, . . . , Si £ S such that g — s± ■ ■ • si}. 

In this paper we study the behavior of 

di := min d(q, 1) 

as / increases. This, off course, depends on the properties of G and the 
generators. Following Gamburd, Jakobson and Sarnak we say that 
the pair 71,72 is Diophantine, if there is a number (3 > independent 
of / such that 

di > Wl-P. 
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Suppose that G = (R, +) and 71,72 is mapped to the real numbers 
x, y by this isomorphism. Then the behavior of d\ depends on the 
distance of x/y to rational numbers of a given denominator. This is 
a classical topic and much is known about it. It is well-known that 
for Lebesgue almost every generators, d\ > Cl~ x ~ e for every e with 
some constant depending on e and x, y. We will outline a proof for this 
fact later. In addition, when x/y is an algebraic numbers the the same 
inequality holds. This latter claim is equivalent to a deep result of Roth 
||. Thus, almost every pairs 71,72 G G = (R, +) are Diophantine and 
as well as those that are mapped to algebraic numbers. 

On the other hand, for a generic pair 71,72 in the sense of Baire 
category, we have d\ < f(l) infinitely often with an arbitrarily fast 
decreasing positive function /. Therefore, generic pairs in the sense of 
Baire category are not Diophantine. 

Now we turn to non-commutative groups. Gamburd, Jakobson and 
Sarnak |4j] noticed that every algebraic 71,72 G G = SU(2) are Dio- 
phantine, and also that this is not the case for generic pairs in the 
sense of Baire category. They posed the problem (see ||, Problem (4) 
at the end of Section 1]) whether almost every pair possesses the Dio- 
phantine property. This problem is of great interest due to a result 
of Bourgain and Gamburd @. This asserts that the random walk on 
SU(2) generated by 71, 72 is mixing exponentially fast, if the pair 71, 72 
is Diophantine. See also their paper ||, where the result is extended 
toSU(d). 

Kaloshin and Rodnianski 0, Theorem 1] proved that for almost every 
pair 71,72 G G = SO(3), there is a number D > 1 such that di > 
D~ l . This bound is weaker than what is required in the definition of 
Diophantine property. 

The problem has recently been solved in nilpotent Lie groups by 
Aka, Saxce and Rosenzweig |] 

The purpose of the paper is to study the problem of Diophantine 
property in a solvable Lie group. More specifically, we consider the 
group of affine transformations of the line: 



We further restrict generality and consider pairs only of the form 



(1) 




(2) 



7i = (o !)' 72= (J 1) 



DIOPHANTINE PROPERTY 



3 



This is indeed a significant loss of generality, but we note that an arbi- 
trary pair can be conjugated into this form, if one of them is unipotent, 
i.e all of its eigenvalues are 1. 

The result of the paper is the following: 

Theorem 1. The set of complex numbers \x\ > 1 such that the pair 
(|2|) is not Diophantine is of Hausdorff dimension 0. 

Since (§) generates the same group if we replace x by x -1 , the theo- 
rem implies the same conclusion for parameters \x\ < 1 . On the other 
hand we do not claim anything for parameters on the unit circle. 

Acknowledgment. I am greatly indebted to Menny Aka and Em- 
manuel Breuillard for communicating the problem to me. I thank them 
and Elon Lindenstrauss, Lior Rosenzweig, and Nicolas de Saxce for 
stimulating discussions about this project. 

2. Outline 

Before discussing the proof of Theorem HJ, we briefly review the 
Abelian case. This argument is folklore. Assume that G = (R, +) 
and the pair 71,72 is mapped into x,y. In fact, we can choose the 
isomorphism in such a way that \x\ < y = 1. This changes the distance 
function we consider on G, but only by a constant factor. 

Observe that any element of W\ is of the form mx+n, where m, n G Z 
and \m\ + \n\ < I. Fix a number A > 0. If \mx + n| < l~ A , then 
\x + n/m\ < m~ x l~ A < m~( 1+A \ Thus if d[ < l~ A for infinitely many 
/, then x belongs to the set 



00 00 m 

^ : =nU U \--m-^ A \- + m 

1 1 v_y v-y Lm m 



fc=l m=k n=—m 



where [a, b] C R denotes the closed interval between a and b. 

Let a > be a number. Then the a-dimensional Hausdorff measure 
of Qa is at most 



lim J2(2m + 1) (2m-( 1+A )) Q = 



if a > 2/(1 + A). Thus the Hausdorff dimension of Qa is at most 
2/(1 + A). This shows that the set of pairs of generators 71, 72 G G = 
(R, +) that are not Diophantine is of Hausdorff dimension 0. 

Our proof of Theorem p] follows similar lines. First, we need to 
understand how elements of Wi look like. This is achieved in the next 
lemma. 
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Lemma 2. Let G be as in ([!]) and 71,72 £ G of the form (fj). Then 
any element of W\ is of the form 

^ / x k x~ n P(x) 

where —l<k<l and < n < I are integers and P(x) = ao + a\x + 
. . . + a m x m is a polynomial with integer coefficients of degree at most 
I + n and YllLo \ a i\ — I- 

Although elements of form (|3p are not all contained in Wi, they are 
easily seen to be contained in Wm. This implies, in particular, that 
\Wi\ grows exponentially. 

Proof. The proof is by induction. The statement is easy for I = 0. 
Multiplying an element of the form (0) by a generator from the left, 
we either increase or decrease the value of A; by 1, or add ±x k to 

x-' n P{x). □ 

If an element of W\ differs from the unit element in the top left entry, 
then the difference is at least 1 — \x\~ l (recall that |x| > 1), which is 
a bound independent of /. Hence it is enough to consider elements 
with k = and hence P(x) ^ in @. To implement an argument 
similar to the Abelian case, we need to understand the set, where a 
polynomial satisfying the properties in Lemma § is small. This was 
off course simpler in the Abelian case, since there we only needed to 
consider polynomials of degree 1. 

In a semi-simple group like SU(2) or SL 2 (R), the elements of W[ can 
be described in a similar fashion, but the polynomials which occur can 
have exponentially large coefficients. Then the problem becomes even 
more difficult. This was considered by Kaloshin and Rodnianski 0, 
and they gave a bound of the form di > A~ l . The reason why we are 
able to give a stronger bound in the case we consider is that we have 
a better estimate on the coefficients and this allows us to control the 
distance between the roots. 

We write 

21 21 
Vi := {P{x) = '^^a i x t : Oj 6 Z and \a^\ < I}. 

i=0 i=0 

This is the set of polynomials that can occur in (^|). Our aim thus is to 
understand the set, where an element of V\ is small. This set is easily 
understood in terms of the roots of the polynomial. Indeed, by the 
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fundamental theorem of algebra, we can write 

m 

P(x) = a m - Zi), 
i=i 

where a m is the highest order non-zero coefficient and z±, . . . ,z m are 
the roots of P(x). 

We see from this formula, that P(x) is small only in the proximity of 
the roots. If the roots are "far from each other" , then P(x) < A~ l for a 
suitably large number A will hold only on I very small regions around 
the roots. The diameter of the regions is approximately (A') , where 
A' is another number depending on what "far from each other" means. 
On the other hand, if there are k roots "very close to each other", then 
P(x) < A~ l may hold on a region of diameter (A')~ l / k . 

Thus we need to show that a typical polynomial does not have many 
roots near each other, and we need to estimate the number of excep- 
tions. We are able to do this only near points which are bounded away 
from the unit circle. We choose a parameter r > and prove that 
P can have at most Clog I roots "near" to a point x G C satisfying 
\x\ > 1 + r and if it has "many" such roots, then it has a "large" coef- 
ficient. (C is a number depending on r.) In fact, we prove much more, 
because we only use that the roots close to x are outside the circle of 
radius 1 + l/2r around the origin. The precise statement will be given 
later in Lemma |], which is a simple application of Jensen's formula. 

In Section [T2] we decompose the annulus {x : 1 + 1/r < \x\ < r -1 } 
into many small regions, and we consider those elements of V\ which 
have many roots close to each other within a single region. As it 
turns out, the difference of two such polynomials also have many roots 
within the same region. Then by Lemma [|, we can conclude that the 
coefficients of the two polynomials must be far away in the l°° norm. 
Finally, in Section |3.3| we estimate the number of Z°°-separated points 
in an / 1 -ball of Z 2 ' +1 . Owing to the definition, the coefficients of a 
polynomial in Vi are bounded in I 1 norm, hence we find an estimate on 
the number of polynomials which have many roots close to each other 
in the annulus {x : 1 + 1/r < \x\ < r -1 }. 

The details of this argument is given in Section |^, where we prove 
the following proposition. 

Proposition 3. For any numbers r > 0, a > 1 there are numbers 
A, C > 1 such that the following holds. For an integer k > 1, denote 
by Qi,k C V\ the set of polynomials P(x) for which the set 

ft pl ■= [ x : \P{x)\ < A~ l and 1 + 1/r < \x\ < r -1 } 



DIOPHANTINE PROPERTY 



6 



can not be covered by 21 disks of radii 2~ al l k . Then for every integer 
k > 1, we have \ Qi t k\ < C ■ 10'/ fc . Moreover, fork > log/, contains 
only the polynomial. 

It is easy to deduce Theorem p] from Proposition |^ and Lemma |2|. 
Indeed, by Lemma |2], it follows that if 71,72 given by (0) is not Dio- 
phantine, then 

00 00 

x e n := p| |J (J Q P)l , 

71=1 l=n PeVi 

provided r is sufficiently small, so that 1 + 1/r < \x\ < r _1 . 

If P ^ Qk,i, then we can cover ilpj by 21 disks of radii 2~ al l k . Thus 
we see that the a dimensional Hausdorff measure of Q is at most 

[log*] 

(4) lim Y] ( \Vi\ ■ 21 ■ 2~ aal + ■ 2/ ■ 2- aal/{k+1) 

k=l 




We have 



Wi\ < 2 2l+1 ■ I , < 100 z . 



Here, the power of 2 stands for the possible choices of the sign and the 
binomial coefficient is the number of ways we can write I as the sum 
of 21 + 1 non-negative numbers taking into account the order of the 
terms. In addition, by Proposition |3], we have 

|Qz,fc| <C-10 l/k <C-100 l/(k+1 \ 
Now (f|) is bounded above by 

00 [log I] 

lim y y 2ci ■ i m i/(k+i) 2 - a ai/ { k + i)^ 

l=n k=0 

This limit is easily seen converging to 0, if we choose a sufficiently large 
so that 

2 aa > 100. 

This shows that the set of numbers x within the annulus {x G C : 1 + 
r < |x| < r -1 } such that the pair 71, 72 given by (0) is not Diophantine 
has Hausdorff dimension at most a. We can take r — > and a — > 0. 
Therefore Theorem [I] will be proved once we verify Proposition |3|. 
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3. Bounding the number of exceptional polynomials 

3.1. We begin by giving the Lemma, which bounds the number of 
large roots of a polynomial in terms of the maximum of the coefficients. 
Note that the bound does not depend on the degree of the polynomial. 

Lemma 4. For every number r > 0, there is a number C r such that 
the following is true. If 

m 

P(x) = a m x m + . . . aix + a = a m Y\_( x ~ z i) 



i=l 



is a polynomial with integer coefficients, then 

\{i:\zi\>l + r/2}\ < C r [log( max |oi|) + 1]. 

0<i<m 

Proof We apply a version of Jensen's formula on the circle \x\ — p 
(1 + r/2) 1 ' 2 . We obtain 



log 



i=0 



> — ! 

2vrp J\ x \ =p 



log \ P(x)\dx 



= ~ / log | O,,, | + J~]l0g\x - Zi\dx 

2^rp J\ X \= P ti 

m 1 f 

= log\a m \ + y^ - — / log \x - Zi\dx 

ti 2n p J\x\=p 

= log|a m |+ Y lo S I + logp - 

i:\zi\>p i-\zi\<p 

Taking exponentials and dividing both sides by p m , we get 



zw m > n 

i=0 i:\z t \>p 



\Zj\ 
P 



> p \{i:\ Zi \>l+r/2}\_ 



(Recall that p 2 = 1 + r/2.) 
Finally, we observe that 



E 



di\p 



< 



P 



max a,- . 



p — 1 0<i<m 



Combining this with the previous inequality and taking logarithms we 
deduce the claim. □ 
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3.2. We fix some real numbers r > 0, a > 1, A and B whose value will 
be determined later. We write the annulus as a union of small regions: 

(5) X := {x G C : 1 + 1/r < \x\ < r~ 1 } = X l U X 2 U . . . U X N 

in such a way that the diameter of each is at most 2~'/ /c , each of 
them contains a disk of radius dl~ l l k and the number of the regions 
is N < CA [ l k . Here and everywhere below, C and c denote numbers 
which may depend only on r and a and their values may change from 
line to line. 
We write 

Qi,k,i ■= {PCPr- \P(x)\ < B ~ l for all x G X,}. 

In the next lemma we exploit the idea that an element of Qi must have 
approximately at least k zeros near each other within the annulus X. 
Hence it must be fairly small on one of the regions X^. Note however 
that the Xi are significantly larger than the disks of radius 2~ al l k that 
fail to cover the set where to polynomial is below A~ l . Thus we have to 
choose B significantly smaller than A in order that the lemma holds. 

Lemma 5. Suppose that B < c ■ A x l a for some suitably small number 
c depending on r. Then 

Qi,k c Qi, k ,i u . . . u Qi AN . 

Proof. Let P G Q tjk and let 

m 

P(x) = a m Y[(x - Zi). 
i=i 

By the definition of Qi :k , the disks of radii 2~ a[ l k around the roots do 
not cover the set where P is below A~ l . Thus there is a point x$ G X 
such that |P(xo)| < A' 1 and \x — Zi\ > 2~ al l k for all 1 < i < m. 

There is a region Xj such that Xo G Xj. We show that P G Qi,k,j, 
that is P(x) < B~ l for all x G Xj. For each root z i: we have 

|^ — %i\ < \ x o — z i \ + \x — x\ < \xq — Zi\ + 2~ l l k . 

Since \x — Zi\ > 2~ al/k , 

\X - Z { \ < \X - Zi\ + \x - Zi\ 1/a . 

If |xo — Zi\ < 1, then \x — z,i\ < 2\x — Zi\ l l a . In the opposite case 

\x — Zi\ < 2\x — Zi\ < 2(\xo\ + \zi\) < 4r _1 max{l, \zi\}. 

Thus, in either case we have 

\x — Zi\ < 4r _1 max{l, | ^ j } j rco — Zi\ l ^ a . 
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Therefore 
\P(x)\ = 



a„ 



n 



jQ[4r 1 max{l, \z{ 



Fo 



By Jensen's formula 

m 

la. 



i=l 



1 /" 

1 T max{l, \zi\} = exp — / \og\P(x)\dx 



< I. 



Thus 



\P(x)\ < l(4r- L ) m \P(x )\ 



I l/o 



< 



A l/a 



since m < 21. This gives |_P(x) | < B provided 

c < 

This can be bounded by a number depending only on r. 



□ 



3.3. In this section, we give an estimate on the size of Qi,k,i- Multi- 
plying this by the number of the regions, we will conclude Proposition 
[3| Taking the difference of two elements in Qi k,i, we g e t a polynomial 
which is very small on the region Xj. This requires that it has many 
zeros near this region and by Lemma ^, it must have a large coefficient. 
This is the content of the next lemma. 

Lemma 6. Suppose that B is sufficiently large depending on a and 
r. Let P Q e Qi^,i- Then the polynomial P — Q has a coefficient 
greater than e 10h . 

Proof. Let 

P(x) - Q(x) = a m (x - z x ) ■ ■ ■ (x - z m ), 
and denote by M the number of roots Z{ of modulus larger than l + r/2. 
Recall that Xi contains a disk of radius c-2~ l l k . This implies that there 
is a point x G Xj such that | x — ^ | > c ■ 2~ [ l k / \J~M . Moreover, if 
\ z i\ < 1 + r/2, then \xq — Zi\ > r/2. 



Thus 



2B- l >\P(x )-Q(x )\>(r/2y 



2~l/k 



M 



By Lemma |], we see that M < yl, otherwise P — Q would have 
coefficients larger than ^/Cr-i^ w hich is impossible if I is sufficiently 
large. Hence 



M 
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for some constant C depending only on r and a. Combining with the 
previous inequality, we get B/C < 2 A///fc , hence 

log B- log C 
log 2 

We combine this bound with Lemma ^ and conclude that P — Q 
must have a coefficient larger than 



logB-logC ,._ | 

J 



11, 1 

e cvio g 2 



which implies the claim for B sufficiently large. □ 

Proof of Proposition [|. First choose B sufficiently large so that the 
conditions of Lemma || hold and then choose A sufficiently large so 
that the conditions of Lemma |5| hold, as well. 

Applying Lemma [|for Q = and any 0/Pg Qi,k,i, it follows that 
P must have a coefficient of size larger than e 10k . This is only possible 
if e l0k < I that is k < (log Z)/10. 

We give an estimate on the size of Qi t k,i- Denote by \_x] the nearest 
integer to the real number x. (When x is half plus an integer, we define 
\_x] = x — 1/2.) Write K = e 10fc . By Lemma |6], the map 

$ : a 2i x 21 + . . . + a x x + a ^ ( [a 2 i/K] , . . . , [aJlC], [a /K] ) 

is injective from Qi^,i to Z 2l+1 . 

For v = {p2u ■ ■ ■ ,b ) G Z 2l+1 we write 

\\v\\i = \b 2 i\ + . . . + |&o|- 

Observe that |L^/i^]| < 2\x\/K for all real numbers x. Hence, for 
every P E Qi t k,i C we have 

\\${P)\\ x <2l/K. 

The number of ways of writing [21 / K] as the sum of at most 21 + 1 
non-negative integers (taking the order into account) can be estimated 
using Stirling's formula, as follows: 



f[2l/K] + 2l\ {21/ K + 2 /) 2 '/™+i/2 

\ 21 ) ~ 1 °(2l) 2l + 1 / 2 (2l/K) 2l / K + 1 / 2 



/ 1 \2i+l/2 

< 10fl + ^J ■(l + K) 2l l K 

< 10 exp[(2Z + 1/2) /K + 21 \og(K + 1)/K]. 
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For each such sum, the number of non-zero terms is at most [21 /K], 
hence there are at most 2 2l l K possibilities for the signs of the coordi- 
nates of $(-P) for any P e Qi k,i- This finally gives 

I Qi.fc.il < exp[2l/K + 21/ K + 21 \og{K + l)/K}. 
Recall that K = e 10fc . It is an easy calculation to check that 

21og(AT + l) + 4 J_ 
K ~ 2k' 

Thus 

\Qi,kA<e l/2k - 

Recall that the number of regions is iV < C ■ 4 i//fc , hence by Lemma 
|] we have 

\Qi,k\ < Ce l/2k 4 l/k <C-10 l/k , 
which was to be proved. □ 
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